Picture for Dit-Yan Yeung

Dit-Yan Yeung

DivLogicEval: A Framework for Benchmarking Logical Reasoning Evaluation in Large Language Models

Add code
Sep 19, 2025
Viaarxiv icon

ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving

Add code
Jul 02, 2025
Figure 1 for ECCV 2024 W-CODA: 1st Workshop on Multimodal Perception and Comprehension of Corner Cases in Autonomous Driving
Viaarxiv icon

Learning 3D Persistent Embodied World Models

Add code
May 05, 2025
Viaarxiv icon

CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback

Add code
Apr 28, 2025
Figure 1 for CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback
Figure 2 for CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback
Figure 3 for CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback
Figure 4 for CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback
Viaarxiv icon

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Add code
Feb 13, 2025
Figure 1 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 2 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 3 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 4 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Viaarxiv icon

Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task

Add code
Feb 11, 2025
Figure 1 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 2 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 3 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 4 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Viaarxiv icon

G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o

Add code
Dec 19, 2024
Figure 1 for G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Figure 2 for G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Figure 3 for G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Figure 4 for G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o
Viaarxiv icon

SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation

Add code
Nov 19, 2024
Figure 1 for SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation
Figure 2 for SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation
Figure 3 for SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation
Figure 4 for SG-LRA: Self-Generating Automatic Scoliosis Cobb Angle Measurement with Low-Rank Approximation
Viaarxiv icon

Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting

Add code
Oct 30, 2024
Figure 1 for Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting
Figure 2 for Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting
Figure 3 for Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting
Figure 4 for Fourier Amplitude and Correlation Loss: Beyond Using L2 Loss for Skillful Precipitation Nowcasting
Viaarxiv icon

Unified Triplet-Level Hallucination Evaluation for Large Vision-Language Models

Add code
Oct 30, 2024
Viaarxiv icon